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Abstract 

We study the power control problem in single-hop wireless ad hoc networks with selfish users. 
Without incentive schemes, selfish users tend to transmit at their maximum power levels, causing excessive 
interference to each other In this paper, we study a class of incentive schemes based on intervention 
to induce selfish users to transmit at desired power levels. In a power control scenario, an intervention 
scheme can be implemented by introducing an intervention device that can monitor the power levels 
of users and then transmit power to cause interference to users if necessary. Focusing on first-order 
intervention rules based on individual transmit powers, we derive conditions on the intervention rates 
and the power budget to achieve a desired outcome as a (unique) Nash equilibrium with intervention and 
propose a dynamic adjustment process to guide users and the intervention device to the desired outcome. 
We also analyze the effect of using aggregate receive power instead of individual transmit powers. Our 
results show that intervention schemes can be designed to achieve any positive power profile while using 
interference from the intervention device only as a threat. Lastly, simulation results are presented to 
illustrate the performance improvement from using intervention schemes and the theoretical results. 

Index Terms 

Game theory, incentives, intervention, power control, wireless networks 

I. Introduction 

Power control is an essential resource allocation scheme to control signal-to-interference-and-noise 
ratios (SINRs) for efficient transmission in wireless networks. Extensive studies have been done on 
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power control (see ||T] and references therein for an overview of the literature in this topic). In many 
earlier works on power control, each user has a fixed minimum SINR requirement and then minimizes its 
transmit power subject to the SINR requirement lH] Ch. 2]||2]|l3l. This approach is suitable for fixed-rate 
communications with voice applications. However, with the growing importance of data and multimedia 
applications, users are no longer satisfied with a fixed SINR requirement, but they seek to maximize their 
utility reflecting the quality of service (QoS). To this end, most recent works formulate the problem in 
the network utility maximization framework. In this framework, a central controller can compute optimal 
transmit power levels when the utility functions are such that the network utility maximization problem 
is convex, and then assigns the optimal power levels to users. Assuming that users are obedient to the 
central controller, the problem can also be solved in a distributed manner |[T] Ch. 4]||4l||5l. 

Besides the network utility maximization framework, many works use noncooperative games to model 
the distributed power control problem, in which each user maximizes its own utility, instead of maximizing 
the network utility. In a noncooperative game model with a single frequency channel, each user tends to 
transmit at its maximum power level to obtain high throughput, causing excessive interference to other 
users. This outcome may be far from the global optimality of social welfare IDEllIll, especially when 
interference among users is strong iQ. To improve the noncooperative outcome, various power control 
schemes have been proposed based on pricing ll8l- |[T2l . auctions |[T3l . and mechanism design |[T4l|[T5l . 
These works aim to achieve a better outcome by modifying the objective functions of users using taxation 
and developing a distributed method based on the optimization of the modified objective functions. Users 
are assumed to be obedient in that they accept the objective functions and follow the rule prescribed by 
the designer, and prices are used as control signals to guide users to a desired outcome. However, selfish 
users may have their own innate objectives which are different from the assigned objectives and may 
ignore control signals and deviate from the prescribed rule if they are better off by doing so. 

In summary, the methods in most existing works are not suitable for power control with selfish users. 
Selfish behavior of users can arise in many practical scenarios without central controllers, such as wireless 
ad hoc networks, where each user transmits information from its own transmitter to its own receiver, 
and multi-cell cellular networks, where the base station cannot control the interfering mobile stations 
in other cells. Hence, it is important to design an incentive scheme to induce selfish users to achieve 
a desirable outcome in power control scenarios. One method to provide incentives for selfish users is 
to impose taxation as real money payment. However, in order to achieve a desired outcome with a 
pricing scheme, the designer needs to know how payment affects the payoffs of users, which is often 
the private information of users. The designer may use a mechanism design approach as in |[T4l|[T5l to 
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elicit private information, but it generally requires heavy communication overheads. Another method 
to provide incentives is to use repeated games llT6lllT7l . However, effective incentive schemes based on 
repeated games require users to have long-run frequent interactions and to be sufficiently patient |[T8l . 

Recently, a new class of incentive schemes has been proposed based on the idea of intervention [19il - 
II2T1 . To implement an intervention scheme, we need an intervention device that can monitor the actions of 
users and intervene in their interaction if necessary. The monitoring technology of the intervention device 
determines what it can observe about the actions of users, while its intervention capability determines 
the extent to which it can intervene in their interaction. An intervention rule prescribes the action that the 
intervention device should take as a function of its observation. Among existing works on intervention 
schemes, |[T9ll and |[20l applied intervention schemes to contention games in the medium access control 
(MAC) layer, while ETI studied the impact of the monitoring technology and the intervention capability 
on the system performance in an abstract model. We also note that |[22l proposed a packet-dropping 
mechanism for queueing games using an idea similar to intervention. In this paper, we focus on a power 
control scenario and study intervention schemes in this particular scenario. 

In the power control scenario considered in this paper, the intervention device estimates the individual 
transmit power of each user or the aggregate receive power at its receiver and then transmits at a certain 
power level following the intervention rule prescribed by the designer. In order to achieve a target 
operating point, the designer can use an intervention rule such that the intervention device transmits 
minimum, possibly zero, power if users are transmitting at the desired power levels, while transmitting 
at a high power level to reduce the SINRs of users if a deviation is detected. In this way, an intervention 
scheme can punish the misbehavior of users and regulate the power transmission of selfish users. We 
first consider a monitoring technology with which the intervention device can estimate the individual 
transmit power of each user without errors. While focusing on a simple class of intervention rules called 
first-order intervention rules, we study the requirements for the parameters of first-order intervention 
rules to achieve a given target power profile as a (unique) Nash equilibrium (NE). We propose a 
dynamic adjustment process that the designer can use to guide users to the target power profile through 
intermediate targets. We then relax the monitoring requirement and consider a monitoring technology 
with which the intervention device can estimate only the aggregate receive power We show that with 
aggregate observation, intervention rules can be designed to achieve a given target as a NE but rarely as 

'Another drawback of II14I and II15I is the assumption that each user's utility function is jointly concave in all the users' 
power levels, which seems to be unrealistic in power control scenarios. 
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a unique NE. Our results provide a systematic design principle based on which a designer can choose 
an intervention scheme (an intervention device and an intervention rule) to achieve a desired outcome. 
Our analysis suggests that, unlike pricing schemes, it is possible for the designer to design effective 
intervention schemes without having knowledge about how users value their SINRs, as long as their utility 
is monotonically increasing with their own SINRs. We also propose a method based on intervention for 
the designer to estimate the cross channel gains, the noise powers, and maximum transmit power levels of 
users without any cooperative behavior of users such as sending pilot signals for channel estimation and 
reporting the estimates to the designer. After obtaining relevant information, the designer can configure 
an intervention rule to achieve a target operating point as NE. 

The rest of the paper is organized as follows. In Section JIJ we describe the system model of power 
control with intervention. In Section JIlJ we propose design criteria for intervention rules, performance 
characteristics to evaluate intervention rules, and classes of intervention rules. In Section JV] we study 
the design of first-order intervention rules to achieve a target power profile. In Section |Vl we discuss 
implementation issues related to intervention. Simulation results are presented in Section |Vl] Finally, 
Section IVIII concludes the paper. For the ease of reference, we summarize major notation used in this 
paper in Table H 

II. Model of Power Control With Intervention 

We consider a single-hop wireless ad hoc network, where a fixed set of N users and an intervention 
device transmit in a single frequency channel. The set of the users is denoted by TV = {1, 2, ... , N}. Each 
user has a transmitter and a receiver. Each user i chooses its transmit power pi in the set Vi = [0, Pj], where 
Pi > for all i G M. The power profile of all the users is denoted by p = (pi, . . . ,pAr) G P = Hi^i 
and the power profile of all the users other than user i is denoted by p_j. 

In the network, there is an intervention device, sometimes referred to as user 0, that consists of a 
transmitter and a receiver. The receiver of the intervention device can monitor the power profile of the 
users, while the transmitter can create interference to the users by transmitting power. Once the users 
choose their power profile, the intervention device obtains a signal y € y, where y is the set of all 
possible signals. We assume that the signal is realized deterministically given a power profile and use the 
signal determination function p : "P — > 3^ to denote the signal given the power profile p? After observing 

^More generally, we can assume that the signal is realized randomly given a power profile and use p(p) to represent the 
probability distribution of signals given p. We leave the analysis of this general case for future research. 
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TABLE I 
Summary of Major Notation. 



Symbol 


Description 


N 


Number of (regular) users in the network 


M 


Set of users, J\f={l,...,N} 


P. 


Maximum transmit power level of user i\ i £ M 


Po 


Power budget of the intervention device, or the intervention capability 




User i's action space Vi = [0, Pi]; i € Af U {0} 


Pi 


Transmit power level of user i, Pi £ Pt; i £ J\f U {0} 


P 


Power profile of the users, p = (pi, . . . ,Pn) € P — Yii'-i Pi 


P* 


Target power profile, p* = {pi, . . . ,Piv) 


P 


Profile of the maximum power levels, P = (Pi, . . . , Pjv) 


hij 


Channel gain from user j's transmitter to user i's receiver; i,j G TV U {0} 


Hi 


Noise power at user i's receiver 


li 


SINR at user j's receiver 


y 


Set of all possible signals obtained by the intervention device 


p 


Signal determination function, p : P ^ y 


f 


Intervention rule, f : y ^ Po 


£(f) 


Set of power profiles sustained by / 




Set of A'th-order intervention rules with target power profile p* 


PBk(j>1 


Minimum power budget to sustain p* using A'th-order intervention rules 


PBU^n 


Minimum power budget to strongly sustain p* using A'th-order intervention rules 


fl 


First-order intervention rule based on individual transmit powers 


a,: 


Intervention rate for user i in fl; i £ M 




First-order intervention rule based on aggregate receive power 




Aggregate intervention rate in fi 




Set of users whose target powers are less than the maximum power levels, A/'(p*) = {i £ N : p* < Pi} 


N' 


Number of users whose target powers are less than the maximum power levels, A^' = |A/'(p*)| 


{(/',p')}r=i 


Sequence of intervention rules and power profiles in the dynamic adjustment process 


{pnr=i 


Sequence of intermediate target power profiles in the dynamic adjustment process 


d(p,p') 


Relative distance between two power profiles p and p', c!(p,p') = ^Vi)lvi 


T*(p*) 


Minimum convergence time (in steps) for the dynamic adjustment process to reach p* 



January 19, 2013 



DRAFT 



6 



a signal, the intervention device chooses its own transmit power pQ in the set Vq = [0, Pq], where Pq > 0. 
We call {y, p) the monitoring technology of the intervention device, and call Pq its intervention capability. 
The ability of an intervention device is characterized by its monitoring technology and intervention 
capability. We call the transmit powers of the intervention device and the users (pO)P) ^ Vq V w 
outcome. 

The QoS obtained by user i is determined by its SINR, denoted by 7^. We use a block-fading channel 
model with sufficiently long fading blocks, as in ll2l-ll5l.ir7l- |[T7l . In one block, for i,j G A/^U{0}, let hij 
be the channel gain from user j's transmitter to user i's receiver, and let be the noise power at user i's 
receiver If code-division multiple accessing (CDMA) is used, the channel gain is defined as the effective 
channel gain taking into account the spreading factor In one block, the users and the intervention device 
transmit at constant power levels p and pQ, respectively.^ Then the SINR of user i € 7\A is given by 

7»(Po,p) = T r : ■ (1) 

We assume that each user i G Af has monotonic preferences on its own SINR in the sense that it weakly 
prefers to 7^' if and only if 7j > 7.. Our analysis does not require any other properties of preferences 
(for example, preferences do not need to be represented by a concave utility function).^ 

In our setting, the intervention device has a receiver to measure the aggregate receive power from all 
the users. Furthermore, if the receiver moves and takes measurement at different locations, it can estimate 
the individual transmit power of each user as well. Thus, in this paper we will focus on two types of 
monitoring technology with which the intervention device can estimate individual transmit powers p or 
an aggregate receive power J2^=i ^OiPi- In other words, we consider two signal determination functions, 
P{p) = P and p{p) = hoiPi. 

^In practice, there is a time lag between when the users transmit and when the intervention device transmits because the 
intervention has to monitor the users' power profile in order to decide its transmit power level. In this paper, we assume that 
this time lag is short and negligible compared to the length of fading blocks, although in principle we can take into account the 
time lag in the users' utility functions. See, for example, 1201 for a model that takes into account the time lag explicitly. 

''Throughout the paper, we use j ^ i with the summation operator to mean j £ Af\ {i}, not j G U {0} \ {i}. 

^The preferences of a user may be defined on dimensions other than its SINR. For example, a user may use the ratio of SINR 
to transmit power as the utility function, as in ||9), because it may care about its energy consumption as well. Our approach can 
be applied to such a scenario if utility functions representing the users' preferences are known to the designer. 
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III. Design of Intervention Rules 

A. Design Criteria 

Since the intervention device transmits its power after it obtains a signal, its strategy can be represented 
by a mapping f y ^ Vo, which is called an intervention rule. The SINR of user i when the intervention 
device uses an intervention rule / and the users choose a power profile p is given by 7j(/(/o(p)), p). With 
an abuse of notation, we will use /(p) to mean f{p{p)). Given an intervention rule /, the interaction 
among the users that choose their own power levels selfishly can be modeled as a non-cooperative game, 
whose strategic form is given by 

rj = (AA,(nW,(7i(/(-),-)W). (2) 

We can predict the power profile chosen by the users given an intervention rule using the concept of 
Nash equilibrium. 

Definition 1: A power profile p* € "P is a Nash equilibrium of the game Tf if 

7i(/(p*),P*) > iiif{m,P-i),m,pU) (3) 

for all Pi € Vi and all i G J\f. 

When a power profile p* is a NE of Tj, no user has the incentive to deviate from p* unilaterally 
provided that the intervention device uses intervention rule /. Moreover, if p* is a unique NE of Fj, 
intervention has robustness in that the designer does not need to worry about coordination failure (i.e., 
the possibility that the users get stuck in a "wrong" equilibrium). 

Definition 2: An intervention rule / (strongly) sustains a power profile p* if p* is a (unique) NE of 
the game Ff. 

We use £{f) to denote the set of all power profiles sustained by /. Suppose that the designer has a 
welfare function C/o(7i, . . . ,7Ar), defined on the users' SINRs. Then the objective of the designer is to 
find a target power profile that maximizes the welfare function and an intervention rule that (strongly) 
sustains the target power profile. Formally, the design problem solved by the designer can be written as 

maxmax{C/o(7i(/(p),p), ■ ■ ■ , 7Af (/(p), P)) : P € £{f)}- (4) 
P / 

If uniqueness is desired, we can replace p e £{f) in dUl with {p} = £"(/). Note that a solution to the 
design problem dD, p* and /*, must satisfy /*(p*) = 0, namely no intervention if the users choose the 
target power profile. Based on this observation, the design problem (UJl can be solved in two steps. In 
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the first step, we obtain a target power profile p* by solving 

max{C/o(7i(0,p),...,77v(0,p)) : p G P}. (5) 
p 

There always exists a solution to the optimization problem dSjl as long as the welfare function Uq is 
continuous in 71, ... , j^. Under some welfare functions, e.g. ?7o(7i5 • • • ) In) J2f=i log(7i) as in Him, 
the optimization problem ^ is convex, and thus easy to solve. If ^ is solved off-line, the designer 
can choose other welfare functions even though the resulting optimization problem is nonconvex. In the 
second step, we look for an intervention rule / such that /(p*) = and p* € £{f) (or {p*} = £{f)), 
given the target power profile p* obtained in the first step. 

The first step of solving the design problem dD, namely finding the target power profile p*, requires 
knowledge about the parameters in the model, hij, rii, and Pi for all i,j G Af U {0}. In Section IVl-A. 
we propose a method for the designer to estimate the relevant system parameters needed to solve <^ 
based on intervention. Note, however, that solving the problem ^ does not require the designer to know 
the details of the users' preferences on their SINRs since knowing the expressions for SINRs suffices 
to evaluate the welfare function and to check the incentive constraints. To highlight the informational 
advantage of our approach, let us consider a pricing scheme, in which each user i is charged 7rj(p) when 
the users choose a power profile p. Suppose that each user i has quasilinear preferences on its own SINR 
and payment which are represented by a utility function of the form nj(7i) — vTj. Then in order to find 
a pricing scheme that sustains a target profile p*, the designer needs to know Ui for all i G Af. Since a 
pricing scheme uses an outside instrument to influence the decisions of the users, the designer needs to 
know how the users value SINRs relative to payments, which is subjective and thus hard to measure. On 
the contrary, an intervention scheme affects the users through their SINRs, and thus the designer does 
not need to know how the users value their SINRs. 

In the subsequent discussion of this paper, we focus on the second step of solving the design problem 
dH), assuming that a target power profile p* has been found. That is, we aim to find an intervention rule / 
that (strongly) sustains p*, namely p* G £{f) (or {p*} = £{f )), and that satisfies /(p*) = 0. Since user 
i can guarantee a positive SINR by choosing a positive power, it is impossible to provide an incentive 
for user i to choose = using any intervention rule. Thus, we assume that p* G rijCOi^j]- 
that (/, p*) is an (intervention) equilibrium if p* G £{f) and /(p*) = 0. At an equilibrium, no user has 
an incentive to deviate unilaterally while the designer fulfills his design criteria. Thus, an equilibrium 
can be considered as a stable configuration of an intervention rule and a power profile. An equilibrium 
can be achieved following the procedure described below. 
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1) The designer chooses a target power profile p* and an intervention rule /. 

2) The users choose their power profile p, knowing the intervention rule chosen by the designer.^ 

3) The intervention device obtains a signal p(p) and chooses its power pQ = /(p). 

To execute the above procedure, we may consider an adjustment process (e.g., one based on best-response 
updates) for the users and the intervention device to reach an equilibrium (see Section ITVl-B). as well as 
an estimation process for the intervention device to obtain a signal (see Section |Vl-A). It is an implicit 
underlying assumption of our analysis that the time it takes to reach a final outcome (i.e., the duration 
of the procedure) is short relative to the time for which the final outcome lasts. This justifies that in 
our model, the users fully take into account interference from the intervention device that is realized 
at the final outcome when they make decisions about their powers. When a network parameter changes 
(e.g., some users leave or join the network, or move to different locations), a new target is chosen 
and the procedure is repeated to achieve a new equilibrium. Thus, our analysis holds as long as network 
parameters do not change frequently, whereas providing incentives using a repeated game strategy usually 
requires an infinite horizon and sufficiently patient players. 

Another important underlying assumption in our analysis is that the designer can commit to the 
intervention rule it chooses. Since Uq is increasing in each 7^ and each 7^ is decreasing in po, the 
designer prefers not to intervene at all, i.e., it prefers to choose po = 0. However, if po is held fixed at 0, 
the users will choose P. The role of the intervention device is to provide a punishment mechanism for 
the users to choose a desired power profile other than P; the device should choose a high power level if 
a deviation is detected. Without the designer's commitment to the intervention rule, the designer would 
choose Po = (e.g., by disabling the intervention device) regardless of the power profile chosen by the 
users. Predicting this behavior of the designer, the users would choose P, resulting in the same outcome 
as with no intervention. Therefore, in order for the proposed intervention schemes to provide incentives 
successfully, it is critical that the designer executes punishment as promised to make punishment credible. 
In practice, credibility can be achieved by programming the intervention rule in the intervention device 
and making the program difficult to manipulate. 

Finally, the benchmark is the welfare when there is no intervention device in the network, i.e., po is 
held fixed at 0. In this case, 7^ is always strictly increasing in pi, and thus P = (Pi, . . . ,Pn) is the 

*The intervention rule can be broadcast to the users, or learned by the users from experimentation. 
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unique NE of the game {J\f, {Vi)i^j\f, (7j(0, ■))i£j\r)-' Thus, the case of p* = P is trivial, because there 
is no need to use an incentive scheme in order to achieve the power profile P. Hence, our main interest 
lies in the case of p* 7^ P, although our analysis does not exclude the case of p* = P. The case of 
p* 7^ P arises in many situations, for example, when interference among users is strong and welfare is 
measured by the sum of utilities or the minimum of utilities lfnplll6ll[7l. 

B. Performance Characteristics 

Given a target power profile p*, there are potentially many intervention rules / that satisfy the design 
criteria p* G £{f) and /(p*) = 0. Thus, below we propose several performance characteristics with 
which we can evaluate different intervention rules satisfying the design criteria. 

1) Monitoring requirement: The minimum amount of information about the power profile that is 
required for the intervention device to execute a given intervention rule (assuming perfect estimation). 

2) Intervention capability requirement: The minimum intervention capability needed for the intervention 
device to execute a given intervention rule, i.e., suppgp /(p)- (Even though there is no intervention at an 
equilibrium, the intervention device should have an intervention capability Pq > suppgp /(p) in order 
to make the intervention rule / credible to the users.) 

3) Strong sustainment: Whether a given intervention rule strongly sustains the target power profile p*. 

4) Complexity: The complexity of a given intervention rule in terms of design, broadcast/learning, and 
computation. 

C. Classes of Intervention Rules 

Without loss of generality, we can express an intervention rule / satisfying /(p*) = as /(p) = 
[S'(p)]o'°' where [x]^ = min{max{x, a}, 6}, for some function g : V ^ ^ such that ^(p*) = 0. Also, 
since the designer desires to achieve p*, it is natural to consider functions g that increase as the users 
deviate from p*. Hence, we consider the following classes of intervention rules, 

( V N K -\^° \ 



for K = 1,2,.... We call an intervention rule / G Tk{v*) a Kth-order intervention rule with target 
power profile p*. As K becomes larger, the set Tk{p*) contains more intervention rules, but at the 

'This is true for any constant intervention rule, where po is chosen independently of the observation of the intervention device. 
This shows the inability of traditional Stackelberg games, where the leader (the intervention device) takes an action before the 
followers (the users) do, to provide incentives in our setting. 




(6) 
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same time complexity increases. Simple intervention rules are desirable for the designer, the users, and 
the intervention device. As K is smaller, the designer has less design parameters and less burden of 
broadcasting the intervention rule; the users can more easily learn the intervention rule and find their 
best responses during an adjustment process; and the intervention device can more quickly compute the 
value of the intervention rule at the chosen power profile. Thus, our analysis mainly focuses on first-order 
intervention rules, the simplest among the above classes. 

Let -Fa'(p*) (-F|-(p*)) be the set of all iiTth-order intervention rules that (strongly) sustains p*, i.e., 
^k{v*) = {/ G ^k{y>*) ■■ V* G £{f)} and Pj^ip*) = {/ G Fk{v*) ■ {p*} = £{f)}- We define the 
minimum power budget^ for a ii'th-order intervention rule to (strongly) sustain p* by 

PSa'(p*)= inf sup/(p), and PB^(p*) = inf sup/(p). (7) 

/eJ-K(p*)peP /e^K(p*)peP 

Thus, with an intervention capability Pq > PBk{v*) {Pq > PBf^'{p*)), there exists a A'th-order 
intervention rule that (strongly) sustains p*. We set PBk{p*) = +00 (Pi?|^-(p*) = +00) if there 
is no Kth-order intervention rule that (strongly) sustains p* (i.e., Tk{p*') (-?^'(p*)) is empty). Since 
Jk(p*) C J='K'ip*) for all K,K' such that K < K' , both PBk{v*) and PB%{p*) are weakly 
decreasing in K for all p*. This suggests a trade-off between complexity and the minimum power 
budget. Also, since Tl^{p*) C J/<(p*), we have PBk{v*) < PBj^{p*) for all K and p*. The difference 
PBf^{p*) — PBk{p*) can be interpreted as the price of strong sustainment in terms of the minimum 
power budget. 

IV. Analysis of First-Order Intervention Rules 
A. Design of Intervention Rules 

We consider first-order intervention rules of the form 



flip) 



^ai\pi-pl 

.i=l 



(8) 



Under a first-order intervention rule, the intervention device increases its transmit power linearly with 
the deviation of each user from the target power, \pi — | , in the range of its intervention capability. We 
call ttj the intervention rate for user i, which measures how sensitive intervention reacts to a deviation 
of user i. Let M{p*) = {i € J\f : p* < Pi}. Without loss of generality, we label the users in such a way 

^As to be shown later, for strong sustainment, we need Po to exceed a certain value. Thus, we actually mean "infimum" 
power budget by minimum power budget. 



January 19, 2013 



DRAFT 



12 



that i G AA(p*) if and only if i < N', where N' = \J\f{p*)\. Since the users have natural incentives to 
choose their maximum powers in the absence of intervention, we need to provide incentives only for the 
users in J\f{p*). The following theorem shows that when the intervention capability is sufficiently large, 
the designer can always find intervention rates to have a given target power profile p* sustained by a 
first-order intervention rule. 

Theorem 1: For any p* G ni(0)-fj]' P* ^ ^ifi) if ^^'^ o^ily if 

ai > ' ' (9) 

Pihio 

and 

^ {P^-Pt)(Z,^.h,^P^ + n^ ^^^^ 
Pihio 

for all i e M{p*). 

Proof: See 122 Appendix B]. ■ 
We can interpret Theorem [T] as follows. As hio is larger, intervention causes more interference to user 
i with the same transmit power, and thus the intervention rate a-i can be chosen smaller to yield the 
same interference. When ^ijPj + ''^i i^ l^rg^' interference to user i from other users and its noise 

power are already strong, and thus the intervention rate Oj should be large in order for intervention to be 
effective. Hence, ^io/(X]jyi ^ijP^ + "^«) interpreted as the effectiveness of intervention to user i. 

Without intervention, users have natural incentives to increase their transmit powers. Thus, as the target 
power for user i, p*, is smaller, the incentive for user i to deviate is stronger, and thus a larger intervention 
rate Ui is needed to prevent deviation. Note that (Pj — p*) is the maximum possible deviation by user 
i (in the direction where it has a natural incentive to deviate). The minimum intervention capability in 
the right-hand side of ([TOl l is increasing with the maximum possible deviation and the strength of the 
incentive to deviate while decreasing with the effectiveness of intervention. 

A first-order intervention rule // satisfying the conditions in Theorem [T] may have a NE other than the 
target power profile p*. For example, if Pq < ^ji^j ~ P^) ^ ^ -^(p*)> P is also sustained 

by fl- The presence of this extra NE is undesirable since it brings a possibility that the users still choose 
P while the intervention device causes interference to the users by transmitting its maximum power 
Pq- Obviously, this outcome (PqjP) is worse for every user than the outcome at the unique NE without 
intervention (0, P). In order to eliminate this possibility, the designer may want to choose an intervention 
rule that strongly sustains the target power profile. The following theorem provides a sufficient condition 
for a first-order intervention rule to strongly sustain a given target power profile. 
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Theorem 2: For any p* € \{i{<d,Pi], {p*} = £{fl) if 



«^ > - E - + .T' (11) 



and 



(12) 



for all i G AA(p*).'' 

Proo/; See US Appendix C]. ■ 
By comparing Theorems [J and |2] we can see that the requirements for the intervention rates and the 
intervention capability is higher when we impose strong sustainment. For any given power profile, the 
intervention rates can be chosen sequentially to satisfy the condition ([iTb starting from user N' down 
to user 1. We can set = for all i ^ A/^(p*). Unlike Theorem [T] the choice of the intervention rates 
affects the minimum required intervention capability. For strong sustainment, the intervention capability 
is required to be larger as the designer chooses larger intervention rates. A main reason for the existence 
of an extra NE is that the region of power profiles on which the maximum intervention power is applied 
is so wide that the users cannot escape the region by unilateral deviation. Thus, a larger intervention 
capability is needed to reduce the region and guarantee the uniqueness of NE. 
From Theorem [T] we obtain 

PBi{p*) = max ^ . (13) 

Since Theorem |2] gives a sufficient condition for strong sustainment, we obtain an upper bound on 
P5f(p^), 



N 



PB\{p*) = Y, 



, 7=1 P'i I Pi^^o 



(14) 



Note that PBi{p*) < PBi{p*) with equality if and only if A^' < 1. Combining these results, we can 
bound PBf{p*) by 

Pi?i(p^) < PBUp*) < PP^(p^). (15) 

By Theorems [U and m we know that first-order intervention rules can sustain the set nj(0)^j]- Note 
that among all the efficient power profiles, those with pi = for some i have probability measure zero. 

'We define X^jg j = if J is empty. Similarly, we define Hje j Xj = 1 if J is empty. 



January 19, 2013 



DRAFT 



14 



Hence, we obtain almost the entire set of feasible payoffs by using first-order intervention rules. As 
argued in Section Illl it is impossible to provide an incentive for user i to choose pi = by intervention 
rules of any orders. Thus, we actually obtain the largest set of sustainable payoffs by using first-order 
intervention rules. This discussion suggests that the potential gain from using higher-order intervention 
rules is not from what they can sustain but how they sustain a target power profile. 
Remark 1: An extreme intervention rule lIlTI . defined by 

0, if p = p^ 



/i(p) 



(16) 



can be considered as the limiting case of first-order intervention rules as each Oj goes to infinity in that the 
area of the region {p : (p) ^ // (p)} approaches zero in the limit. With this class of intervention rules, 
we have £{f^) = {p*,?} if ^ PBi{p*) and f(/e) = {P} otherwise. Therefore, it is impossible 
to construct an extreme intervention rule that strongly sustains a target power profile, except in the 
uninteresting case p* = P. This motivates us to study intervention rules other than extreme intervention 
rules. 

B. Dynamic Adjustment Processes 

Previously, we have derived the conditions under which a target power profile is (strongly) sustained. 
Now we propose a dynamic adjustment process, in order to guide the intervention device and the users to 
achieve the target power profile as NE. The adjustment occurs at discrete steps, labeled as t = 1, 2, . . .. 
We allow the use of different intervention rules in different steps. Thus, the beginning of each step is 
triggered by the intervention device's announcement of the intervention rule to be used in that step. Users 
are synchronized by the announcement of the intervention rules. The adjustment process is based on the 
myopic best-response updates of the users and is described in Algorithm [T] 

During the adjustment process, the designer may use different intervention rules, as well as intermediate 
target profiles different from the final target p*. That is, we have /* € Ti (p*), where p* is the intermediate 
target power profile at step t. In the adjustment process, the designer chooses a sequence of intervention 
rules. Suppose that the designer uses an update rule tp : V ^ Ti to determine an intervention rule based 
on the power profile in the previous step. Then given an initial power profile p*^, an update rule yields 
a sequence of intervention rules and power profiles {(/*, p*)}^!-'*^ We can evaluate an update rule by 

'"since the best response correspondence is non-singleton only in knife-edge cases, we focus on update rules that yield a 
deterministic sequence. 
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Algorithm 1 A dynamic adjustment process. 



1: Initialization: t = 

2: The users choose an initial power profile p*^. 

3: The designer announces the use of first-order intervention rules /* G Ti with power budget Pq. 
4: while p* / p* or /*(p*) 7^ do 

5: t-^t + l 

6: Given p*^^, the designer chooses and broadcasts the intervention rates and the target power 
profile p* for the current time slot t. 



Given /*(p) = '^l \Pi ~ Pi\ > ^^^^ user i chooses a best response to p!_/: 



t-1. 



8: € BR,{f, p*_/) ^ argmaxp,gp, 7i{f,Pi, P*-/)- 

9: end while 



the following two criteria. 

1) Convergence: Does the induced sequence reach an equilibrium (/, p*)? If so, how many steps are 
needed? 

2) Minimum power budget: How much power budget is needed to execute i.e., sup^ suppg-p /*(p)? 

The following theorem shows that when the target power profile p* is close to the maximum power 
profile P and the intervention capability Pq is large, we can obtain fast convergence as well as strong 
sustainment. 

Theorem 3: For any p* € 11^(0, Pi] such that Xlili(^i -Pi)/Pi < 1. there exists (ai, . . . , oat) G 
such that 

> — > aj{Pj -Pj)+ ^, (17) 

for all i £ M{p*). Suppose that ff G 7'i(p*) satisfies ([T7]l and 

^0 > — > aj{Pj -Pj) + (18) 

Pi ^ P*hio 

for all i G J\f{p*). Then {p*} = £{fi)- Moreover, starting from an arbitrary initial power profile p*^ 7^ p*, 
the adjustment process with /* = f( for all t = 1,2,... reaches (/^, p*) in at most two steps (and in 
one step if pO g \{i\pt,Pi])- 

Proof: See US Appendix D]. ■ 
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The minimum power budget required to execute an intervention rule described in Theorem |3] is given 

by 

PB.ip*) = E pt • (19) 

Since the requirement for Oj in (fTTl ) is more stringent than that in ([TTI l. we have PSi(p*) > PB\{p*) 
with equality if and only if N' < 1. The difference PB i{p*) — PBl{p*) can be considered as the 
price of fast convergence to p* in terms of the minimum power budget. In addition to requiring a larger 
power budget, Theorem |3] imposes a restriction on the range of target profiles. That is, the target should 
be close enough to P for the result of Theorem |3] to hold. However, the target may not satisfy the 
restriction X]i^i(-f« ~Pi)/Pi < 1- I^i this case, the designer needs to use intermediate target power 
profiles that are successively close to one another in order to guide the users to the final target. The 
use of intermediate target power profiles is also necessary when the intervention device does not have a 
large enough power budget to strongly sustain a target power profile. In this case, the designer can use 
a sequence of intervention functions to drive the users to reach the target power profile as the unique 
outcome. This process requires smaller power budget than that required by strong sustainment, but may 
take longer time for the system to reach the target power profile. 
Define the relative distance from p to p' by 

N _ , 

^(P'P') = E^- (20) 

Using the proofs of Theorems |2] and |3] we can show that, given p*~^, the designer can achieve the 
intermediate target at step t, i.e., p* = p* only if p* satisfies d(p*~^,p*) < 1. This imposes a bound 
on the relative distance between two successive intermediate targets. Below we provide two different 
methods for the designer to generate intermediate targets. The first method, which is summarized in 
Algorithm |2] and analyzed in Theorem lU produces a sequence of intermediate targets reaching the final 
target whose successive elements have a relative distance of 6 € (0, 1) while requiring the minimum 
power budget in each step. This method can be used in a scenario where the power constraint of the 
intervention device does not bind; the designer can fix 6 sufficiently close to 1, and the method will 
allow the system to reach the final target in the minimum number of steps. The second method, which is 
summarized in Algorithm |3] and analyzed in Theorem |5] yields a sequence of intermediate targets with 
the largest relative distance in each step while satisfying the power constraint. Thus, this method will 
allow the manager with a limited power budget to reach the final target as fast as possible. 
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Now suppose that a fixed relative distance between two successive targets is given. Algorithm |2]provides 
a method for the designer to generate intermediate targets in the most power-budget efficient way. 

Algorithm 2 An algorithm that generates a sequence of intermediate target power profiles with a fixed 

relative distance. 

Require: Fix 6 G (0, 1) close to 1 

1: Initialization (t = 1): Set = P {This step can be skipped if p" = P} and M = J\f 

2: Set fli = p*/p] for all i E 7\A 
3: while T.f=M-P*)/p\ > 1 do 

4: t^t + l 

5: while iii<N-5Ao 

6: Choose i* G arg maxjg^K — ^^^^ - 

7: Set = min{l, N - 6 - /"jl and ^ \ {i*} 

8: end while 

9: Set pI = p*/ni for alH G AT 

10: end while 

11: Set p* = p* 



The following theorem shows that the designer can lead the users to the final target by using interme- 
diate targets generated by Algorithm |2] provided that the intervention capability is sufficiently large. 

Theorem 4: For any p* € ni(0'-f'i]' if S > 1 — mini(p*/Pj), then Algorithm |2] temiinates at a finite 
step T with T <N' + l. Let (p*)^i be the sequence of power profiles generated by Algorithm |2] Then 
there exists a sequence of intervention rules with /* € /'i(p*) such that the adjustment process 

with yields p* = p* for all t = 1, . . . , T starting from any p*' € V. 

Proof: See |[23l Appendix F]. ■ 

Now we consider a scenario where the intervention capability Pq should be taken into consideration 
while generating intermediate targets^^. In this scenario, in order to induce the users to follow intermediate 
targets during the adjustment process, the intermediate targets (p*)^2 should satisfy not only 

A' ~t-l ~t 

EV^*^ (21) 

"Note that to sustain p* as the NE, we require Po to satisfy the conditions in Theorem [T] 
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but also 



Pq = max < 



iGA^(p*) 



+ ^ rr; ^ ^ > < Po (22) 



for all i = 2, . . . , T. In order to reach the final target in the minimum number of steps, we need to 
maximize the relative distance between successive targets while satisfying the constraints dlTb and (|22] |. 
Thus, the problem to obtain p* given p*^^ can be written as 

N _ 

maxp* J]^L_^ (23) 

i=l Pi 



s.t. max { ^^^i^T^ + ''7 b'r^ } < Pq - ei (24) 

iempn I Pt^ 1 _ ^^^^^ pI~^ ' 



N _t_i ,t 

EV^-'-^^ (25) 
j=i ■'^j 

P* < P* < P*"' (26) 

for small ei , £2 > 0. Algorithm |3] formalizes the method to generate a sequence of intermediate target 
power profiles, which has maximal relative distances (MRD) between successive target power profiles 
given an intervention capability Pq. We call the sequence generated by Algorithm |3] the MRD sequence. 
Note that the major complexity in solving the above problem is line [10] in Algorithm |3] This search on 
p\, can be done efficiently by bisection method, because Pq is decreasing with p*. . 

Given the power budget Pq, we are interested in the minimum convergence time for the dynamic 
adjustment process to reach the target power profile p*, defined by T*(p*) = inf [T : = p*}, where 
the infimum is taken over the set of sequences satisfying (|2TI ) and (l22l) starting from p^ = P. In order 
to obtain an upper bound for T*(p*), we use an upper bound for the convergence time of a geometric 
sequence of T intermediate target power profiles in the following form: 

pl = {r,iY-^P„ y i, t = l,...,T, (27) 

where r]i = {pyPi)"^^ , i = l,...,N. 

Theorem 5: If pO p*, E^=i(^i - Pi) /Pi > 1' and 

Pq > max - 1 max ^^ , ^ , (28) 

V » Pi / » hio 

then T*(p*) > 2 and r*(p*) satisfies 

N 



T*(p*)-2 1 

^\PI <N-1 + -, (29) 

i=i ' 
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Algorithm 3 An algorithm that generates a sequence of intermediate target power profiles with maximal 
relative distances given an intervention capability. 

Require: Small ei G (0, 1) and £2 € (0, 1); Pq - ei > max | ~J^' ■ ^j^^^'^^^ 

i [ Pi hio 
1: Initialization {t = 1): Set = P {This step can be skipped if p*' = P} 

2: while p* / p* do 

3: t^t + l, M = {i: v\~^ > p*}, p* = p*-^ 

4: repeat 

5: i* = mm^M bl~\ pI = max {p*., (A^ - 1 + £2 - Y^i^i- ' P*"^} 

6: calculate Pq* by (EUl 

7: if Pq* < Po - ei then 

8: M^M\{i*} 

9: else if Pq > Po - ei then 



10: find pj, € max 

11: end if 

12: until Pq = Po - ei or M 

13: end while 

14: Set p* = p* 



[p*, , (A - 1 + 62 - pVpI~') ' pI~'} ,P^'] SUCh that P* = Pq - 6l 



where 

C = ^ ■ ^° + (30) 

maxj — • maxj -r ii^ax, 

Proof: See 112 Appendix G]. ■ 
The inequality (l29l ) provides an upper bound for r*(p*), since the left-hand side of (l29l ) increases 
in r*(p*) and approaches N as T*(p*) — > 00 while the right-hand side is smaller than N given (|28] ). 
From Theorem |5] we can see that the convergence time is small if the power budget Pq is large, the 
target power profile is close to the maximum power (i.e. maxj is small), or SINR is relatively small 
compared to the channel gain from the intervention device (i.e. maxj is small). 

C. Relaxation of Monitoring Requirement 

The results in this section so far relies on the ability of the intervention device to estimate individual 
transmit powers. However, estimating individual transmit powers requires larger monitoring overhead for 
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the intervention device than estimating aggregate receive power In order to study intervention rules that 
can be executed with the monitoring of aggregate receive power, we consider a class of intervention rules 
that can be expressed as 



"0 



TV 



^ hoiPi - Pa 



Po 

(31) 





for some uq > 0, Pq > 0, and p^. We call an intervention rule in this class a first-order intervention rule 
based on aggregate receive power or, in short, an intervention rule based on aggregate power We call 
ao the aggregate intervention rate, and call p\ the target aggregate power, which is set as the aggregate 
receive power at the target power profile, i.e., p*^ = Xli^i hoip*. We first give a necessary and sufficient 
condition for an intervention rule based on aggregate power to sustain a target power profile. 
Theorem 6: For any G Uii^^Pi]^ P* ^ Siff) if and only if 

ao > max — (32) 

ieJV{p*) h'OiPinio 

and 

iPi-pmj:,^^h,,p^j+n^) 

Pq > max . (33) 

i6AA(p*) Pi^iO 

Proof: See El Appendix H]. ■ 
The minimum intervention capability required to sustain a target profile is not affected by using 
aggregate receive power instead of individual transmit powers. However, the aggregate intervention rate 
should be chosen high enough to prevent a deviation of any user, whereas with the monitoring of individual 
transmit powers the intervention rates can be chosen individually for each user This suggests that strong 
sustainment is more difficult with intervention rules based on aggregate power For example, P is also 
sustained by ff if Pq < ao J2jjtii^0jPj ~Pj) for all i € J\f{p*), which is weaker than the corresponding 
condition in the case of intervention rules based on individual powers, Pq < '^j^iO!j{Pj — p*) for all 
i G J\f{p*). With the monitoring of individual powers, a deviation of each user can be detected and 
punished. This leads to the property that the best response of user i is almost always either p* or 
Pi under first-order intervention rules based on individual powers. This implies that a power profile 
sustained by a first-order intervention rule based on individual powers almost always belongs to the set 
]^j{p*,-Pj}. In contrast, with the monitoring of aggregate power, only an aggregate deviation can be 
detected. This yields a possibility that an intervention rule based on aggregate power sustains a power 
profile that is different from the target but yields the same aggregate power This possibility makes the 
problem of coordination failure more worrisome because if the users are given only the target aggregate 
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power p*^ they may not know which power profile to select among those that yield the aggregate power 
p\.^^ The problem arising from the increased degree of non-uniqueness can be considered as the cost 
of reduced monitoring overhead. To state the result formally, let Oq = (Xljyj^iiPj + ^i)/(^OiJ'i^^io) 
and P(5 = {Pi - P*){Yjj^ihijP] + ni)/{p^hiQ) for all i G A/'(p*). Also, let ao = max .^^^p, ^ and 

Theorem 7: Suppose that, for p* G Wii'^^ Pi]^ there exist i,j € M{p*) such that (i) ao = > oPq or 
al.ai < ao, and (ii) Pq = P^ > P^ or P^,P^ < Pq. Then for any ff such that p* € £{ff) and for 
any e > 0, there exists p / p* such that p G £{ff), Y^iLi ^OiP* = Y^iLi hoiPi, and |p - p*| < e. 

Proof: See El Appendix I]. ■ 

Theorem |7] provides a sufficient condition under which the strong sustainment of a given target power 
profile is impossible with intervention rules based on aggregate power. We argue that the sufficient 
condition is mild. First, note that, for almost all p* G ni(0>-f«]' c^o'^ and Pq's can be ordered strictly. 
With strict ordering of Og's and Pq's, we can always find a pair of users i,j G M{p*) satisfying the 
condition in Theorem|7]if there are at least three users in M{p*). That is, strong sustainment is generically 
impossible with intervention rules based on aggregate power when |A/^(p*)| > 3. 

V. Implementation Issues 

In this section, we discuss some implementation issues. First, we provide algorithms for the intervention 
device to estimate the system parameters and individual transmit powers without user cooperation. Then 
we compare the communication overhead of the intervention scheme with that of other incentive schemes. 

A. Estimation of System Parameters and Individual Transmit Powers 

As we have seen from previous results (e.g. Theorems [I]-|3), in order to determine the intervention 
rates and the power budget requirement, the designer needs to know the normalized cross channel gains 
{jr^ljLi j^ii ^ the normalized noise powers Vi G TV, the maximum power levels Pj, Vi G TV, 
and the target power profile p*. We propose a method to estimate the normalized cross channel gains, 
normalized noise powers, and the maximum power levels without user cooperation. Based on the above 
parameters, the designer can determine the target power profile p* by solving (O. In addition, we propose 
a method to estimate the individual transmit powers without user cooperation. 

'^A way to overcome this problem is to broadcast tlie target power profile p* to the users in order to make p* as a focal 
point l24l . 
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1) Normalized cross channel gains and noise powers: The designer estimates the normalized cross 
channel gains and the normalized noise powers by adjusting the intervention rules and observing the 
reaction of the users. First, the designer broadcast the intervention capability Pq ^^d a temporary target 
power profile p < P. Then it makes N rounds of measurements at different locations. We assume 
that during the measurements, the users always choose the power levels that maximize their SINRs given 
the intervention rule. Thus, we exclude the strategic behavior of the users to influence the measurements 
in their favor. We also assume that the intervention device can move its receiver to N different locations, 
or it has N receivers located at different locations. 

Algorithm 4 The nth round of measurement performed by the intervention device. 
Require: Error tolerance e > 

1: Initialization: Broadcast aj = 0, Vj € M 

2: for index = to — 1 do 

3: Set i = {n + index) mod A^, = 0, Oj = 0, any positive value (preferably large) 

4: Measure the aggregate receive power Ri and set the current aggregate receive power Ri = Ri 

5: while Qi = or cij — > e do 

6: if ai = and Ri = Ri then 

7: tti ^ 2 • a, 

8: else if Ri = Ri then 

9: ttj ^ tti, ai (Oj + a.i)/2 

10: else 

11: o-i -(r- ai, ai ^ (oj + ai)/2 

12: end if 

13: Broadcast the index i and the new ai, and measure the current aggregate receive power Ri 

14: if Ri < Ri then 

15: Set R^ = Ri 

16: end if 

17: end while 

18: end for 



In round n, the designer adjusts the intervention rates one by one, starting from a„ to a^, then from 
ai to a„_i. All measurements are made at the receiver at location n. When adjusting ai, the aim is to 
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find a*, the minimum intervention rate at which user i's best response is pi. We can calculate al as 

(34) 



a,- 



i > n 



hio 

The designer tunes ctj to find a* according to the change of the aggregate receive power. When > , 
user i's best response is Pi, and the aggregate receive power at location n is 



n 



(35) 

[ E ■=! K,P, + E}=„ /^o.P. + Ef=. /^o,^'. + ^0, ^ > n 
where h^- is the channel gain from user j's transmitter to the intervention device's receiver at location 
n. When Oj > a|, user i's best response is the target power profile pi, and the aggregate receive power 
decreases to 

m = { . (36) 



y:;zI h^o.Pj + E;=n ^s.^^. + Ef=.+i /^s,^'. + ^o, ^ > 



n 



During the measurement, the designer maintains an upper bound Oj, at which the aggregate receive 
power is R^, and a lower bound a^, at which the aggregate receive power is Ri. By bisection methods, 
an estimate of a*, denoted by nini, is obtained within the error tolerance e. The nth-round measurement 
is summarized in Algorithm H] 

Round n returns a set of measurements from which we obtain maximum power levels 

{Pi]f=i- Note that BJ^ - Rf = Ki ■ {Pi - Pi) for all i,neJ\f. Thus, we have 

nn nn 

hl=hl^-^r^^ Vi,nGAr. (37) 

K; — Hi 



Since Rl^^ = Zf=i KjPj (M = when n = 1), using the above relationship between /iq. and h^^, 
we have the following N linear equations 

^ ( RP' — i?" \ 

E ^Oi • (^ ^■i_g p.J = n = 1, . . . , AT, (38) 

which we can solve for {/lojl^Li- Given {/iQ^jj^j^, we can calculate {Pi\f^^ by using R^l — Rj = 

hl.-{Pi-pi),yieM. 

Round n also returns another set of measurements m„ = [m„i, . . . , m„j, . . . , m„jv], where we 
assume m„j = a*. Given the measurements {m„}^^^, we can obtain the normalized cross channel gains 
and the normalized noise powers. Specifically, from m„ and m„+i,V n < N, we have 

m„+i,i - nini = r , V z / n, (39) 

n-io 
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from which we can get ^ = (m„+i,j - mni)/{Pn - Pn)y i / n. 

To sum up, we can get according to m„ and m^+i for all n < N, and get {^^}j^7v according 

to mjv and mi. Now that we know all the normalized channel gains, we can get the normalized noise 
powers ^ from (l34l ). 

2) Individual Transmit Powers: The byproduct of the above estimation is the channel gains from 
the users to the intervention device at N different locations: {^oii^i'"- ~ 1, • • • ,iV. At any time, the 
intervention device can measure the aggregate received power at all the N locations 

N 

^K,P,+nl (40) 

where rig is the noise power known to the intervention device. Since the designer knows the values of 
N different linear combinations of the N individual transmit powers, it can solve the group of N linear 
equations to obtain the individual transmit powers. The complexity of this operation is of the order A^^. 

B. Comparison of Communication Overhead 

Now we compare the communication overhead of the intervention scheme with that of other frame- 
works, including network utility maximization ||4]||5], game theoretic control based on taxation ll8l- |[T3l . 
and mechanism design |[T4ll[T5l . Before the comparison, we would like to emphasize that the intervention 
scheme works for selfish users, who have no incentive to provide any information to anyone else. As 
shown in Table JIl intervention requires no information flow from the users to the designer. On the 
contrary, the other works assume that the users are obedient to exchange information with the designer 
or among each other according to some prescribed rules. 

The communication overhead is measured by the total amount of information flow before the system 
reaches the desired operating point. Specifically, the amount of information flow is measured by the 
number of real numbers broadcast by the users and the intervention device, ignoring quantization and 
coding. The communication overhead can be further categorized into the communication overhead on the 
users and that on the designer. We summarize the comparison in Table |II1 In each framework, we select 
representative algorithms to calculate the communication overhead. Hence, the numbers in the table are 
not precise for all the algorithms, but are correct in terms of the order. 

We can see from Table HI] that in intervention, users have zero communication overhead. In other words, 
the intervention device does not rely on the users to provide information. While in other frameworks, 
users may be required to broadcast some information. Hence, intervention is more suitable when users 
are selfish and unwilling to provide information truthfully. Another advantage of intervention is that its 
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TABLE 11 

Comparison of Communication Overhead of Different Frameworks 



FrLinicwork 


Users 


Xhe Designer 


Overall 


Intervention 





2N^ log2(l/e) + 2iV-r*(p*) 


2N^ log2(l/e) + 2Af ■r*(p*) 


Network utility maximization 


(ID) or (El) 

in each step 


(HI) or N (0) 

in each step 


A'' in each step 


Game theoretic control 
based on taxation 181-1131 


A'^ in each step 


A'^ in each step 


2N in each step 


Mechanism design I14II15I 


A'^^ in each step 


TV in each step 


7V^ -\- N m each step 



communication overhead can be bounded analytically. While in other frameworks, the convergence speed 
to the desired operating point is not guaranteed. The detailed analysis is as follows. 

1 ) Intervention: The communication overhead of intervention is comprised of two parts. The first part 
is the overhead of estimating normalized cross channel gains and noise powers. The temporary target 
power profile p and the intervention capability Pq is broadcast at the beginning, which we omit here. 
There are rounds of measurement. In each round, the intervention device needs to broadcast the initial 
intervention rates (line [T] in Algorithm |4), and the indices and the values of the intervention rates it is 
adjusting (line [13] in Algorithm HJ). Since it uses bisection methods to adjust the intervention rate, the 
number of adjustments is log2(l/e) for each intervention rate, where e is the eiTor tolerance in estimating 
intervention rates in Algorithm 5] In sum, the communication overhead in estimating system parameters 
is N{2Nlog2{l/e) + I) ^ 2iV2 ^Qg^^i/g) for intervention rates. 

The second part is the communication overhead during the convergence to NE. In general, this overhead 
depends on the target power profile and the intervention capability. If the target power profile is close to 
the maximum power level iJ2iLi(^i ~V^)/Pi < 1) ^nd the intervention capability is large (Theorem |3), 
the convergence is in one step. If the target power profile is not close enough or the intervention capability 
is limited, we need to use the dynamic adjustment process. The convergence is in A^' + 1 steps if the 
intervention capability is large (Theorem^), and is in r*(p*) steps if the intervention capability is limited 
(Theorem |5]). In each step, the intermediate target power profile and the intervention rates are broadcast. 
In summary, the worst-case communication overhead during the convergence to NE is 2N ■ T*(p*). 

2) Other frameworks: The communication overhead of other frameworks is the total information flow 
during the convergence to the desired operating point. Unlike intervention, the number of steps during 
the convergence is not bounded. Hence, the communication overhead could be arbitrarily large. 
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Fig. 1. An example single-hop wireles.s ad-hoc network with two users. 

If users cooperate with the designer to maximize the assigned utiUty function, as in ||4l||5l and |[8l- |[T3l . 
the information flow in one step can be less than that in intervention. In network utiUty maximization, 
either each user or the base station will broadcast at least N signals for all the users. In (4], each user 
broadcast its own interference price. In ||5], the base station broadcast "load" and "spillage". In game 
theoretic control based on taxation ll8l- llT3l . the base station broadcast N prices, usually different for 
different users. To obtain the optimal pricing, users report back some information, such as their payoffs 
or interferences, to adjust prices. 

In mechanism design |[T4ll[T5l . similar to intervention, the designer does not know the utility function 
of each user. In this case, the information flow in one step is of the same order as that in intervention. 
Specifically, each user broadcast its own version of optimal power allocation vector in each step, and 
the designer broadcast the reference power allocation vector. Hence, the amount of information flow is 
A^^ + in each step. 

VI. Simulation Results 

We consider a two-user network shown in Fig. [T] User 2's transmitter is near to user 1 's receiver, causing 
significant interference to user 1. The distance from user I's transmitter to its receiver is normalized to 1. 
Originally, the distance from user 2's transmitter to its receiver is 0.5. The vertical distance between the 
two users' transmitters and that between the two users' receivers are both 0.5. Without specific notice, 
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Distance between the transmitter and receiver of user 2 



Fig. 2. The optimal social welfare achievable by intervention and the social welfare at NE without intervention, when user 2's 
transmitter moves away from its receiver. 



we assume that the positions of the transmitters and receivers of both users remain the same. In the 
simulation of Fig. |2j we let user 2's transmitter move away from its receiver as shown by the dashed left 
arrow, resulting in less interference to user 1. The channel gain h is reciprocal to the distance d with the 
path loss exponent a, namely h = d~°'. We assume an indoor environment where a = 3 ||25l . The noise 
powers at the receivers of both users are 0.2. The power budgets of both users are 10. 

A. Performance Improvement By Intervention 

Now we examine the performance improvement by using intervention mechanisms. We let user 2's 
transmitter moves away from its receiver. In Fig. we show the performance achieved by intervention 
and that at the NE without intervention, under two criteria for social welfare. The sum rate is define by 

log(l + 7i) + log (1 + 72), (41) 

and the fairness is defined by the "max-min" fairness lH] pp. 3921ll5l l[26l 

mill {log (1 + 71), log (1 + 72)}. (42) 

As we can see from Fig. |2] the sum rate achievable by intervention doubles that at the NE without 
intervention in all the cases. The fairness achievable by intervention is much larger than that at the NE 
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Target power for user 1 Target power for user 1 

(c) (d) 

Fig. 3. Contour of the minimum power budget of first-order intervention under different target power profiles, (a): minimum 
power budget to sustain a target power profile obtained by Theorem [T] and Theorem |6l (b): minimum power budget to strongly 
sustain a target power profile obtained by simulation; (c): upper bound on the minimum power budget to strongly sustain a 
target power profile obtained by Theorem |2] (d): upper bound on the minimum power budget for strong sustainment and fast 
convergence obtained by Theorem |3] 

without intervention in most cases. When the distance from user 2's transmitter to its receiver is 1.0, the 
network is symmetric. Only at this point is the NE without intervention optimal in terms of fairness. 

B. Minimum Power Budget 

Now we show the power budget requirement for different intervention rules. In Fig. [3j we show the 
contour of the minimum power budget for different intervention rules under different target power profiles. 
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Fig. 4. Given the relative distance between adjacent target power profiles, the convergence time and the power budget requirement 
of different sequences of intermediate target power profiles in a five-user network. The relative distance between the maximum 
power profile and the target power profile is Xii^i ^'j ~ 3.6 > 1. (a); convergence time; (b): power budget requirement. 



Fig. | 3Ia)| shows minimum power budget to sustain a target power profile using first-order intervention 
based on individual transmit powers obtained by Theorem [T] and that using first-order intervention based 
on aggregate receive power obtained by Theorem |6] Since the power budget requirements are the same 
for these two intervention rules, we show them in the same figure. Fig. | 3i^b)| shows the minimum power 
budget to strongly sustain a target power profile obtained by simulation. As we expect, the power budget 
requirement for strong sustainment is higher. Fig. |3 Kc)| shows the upper bound on the minimum power 
budget to strongly sustain a target power profile obtained by Theorem |2] We can see that the result in 
Theorem |2] serves as a good upper bound. Finally, Fig. |3 |{d)| shows the upper bound on the minimum 
power budget for strong sustainment and fast convergence obtained by Theorem |3] In this case, the system 
reaches NE in at most two time slots. To achieve this fast convergence, the intervention device needs 
a much higher power budget. In addition, not all the target power profiles can be sustained. The target 
power profiles that cannot be sustained lie in the shadow area in the figure. 

C. Power Budget and Convergence Time Tradeoff In Dynamic Adjustment Process 

Now we study the tradeoff between the power budget and the convergence time in the dynamic 
adjustment process. Here the convergence time is measured as the number of steps in the adjustment 
process. To better illustrate the tradeoff, we use a five-user network in the simulation for Fig. |4]and Fig. |5] 
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Fig. 5. Given power budget requirements, tlie convergence time of different sequences of intermediate target power profiles in 
a five-user networlc. Tlie relative distance between the maximum power profile and the target power profile is Yl^=i ^'p-^' ~ 
3.6 > 1. 



The channel gains and noise powers used in the simulation are one realization of the random variables. 
Since different realizations result in similar tradeoff, we only show the results for one realization. The 
target power profile is = Pi and p* = O.lPj for i > 1. Since the relative distance between the 
maximum power profile and the target power profile is Yli=i ^'p.^' = 3.6 > 1, we cannot reach the 
target power profile from the maximum power profile directly u sing Theorem |3] Instead, we need a 
sequence of intermediate target power profile before we reach the final target power profile. 

First, suppose that there is no power budget requirement. Without power budget limit, our goal is to 
reach the final target power profile in as few time slots as possible. Since it is not easy to construct a 
sequence of intermediate target power profiles given a desired convergence time, we construct the sequence 
according to the desired relative distance between adjacent intermediate target power profiles, which is 
an indicator for the convergence time. In Fig. HI we show the convergence time and the power budget 
requirement of the MRD sequence generated by Algorithm |2] and the geometric sequence under different 
relative distances. We can see from Fig |4 |[a)| that a larger relative distance results in a faster convergence 
for both sequences. Thus, we can use the relative distance, a metric amenable for the construction of the 
sequence, to control the convergence speed of the adjustment process. In particular, when the relative 
distance is(5 = 0.9 = l — miiij {p* /Pi}, the MRD sequence converges in A^' + 1 = 5 steps as predicted by 
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Theorem |4] In Fig |4 ||^b)[ we can see that for both sequences, the power budget requirement is decreasing 
with the relative distances in most cases. The power budget is lower for the MRD sequence. For both 
sequences, it requires much less power by the dynamic adjustment process than by the strong sustainment 
condition in Theorem |2] 

Second, suppose that there is a power budget requirement. Given different power budget requirements, 
we show the convergence time of MRD and geometric sequences and the upper bound on the convergence 
time in Fig. [5] We can see from the figure that under most power budget requirements, the convergence 
time of MRD sequence is roughly half of that of the naive geometric sequence. When the power budget 
is small (near to the minimum power budget that sustains the target power profile), the fast convergence 
of MRD sequence is even more significant compared to that of the geometric sequence. When the 
power budget approaches the minimum requirement that strongly sustains the target power profile, the 
convergence time of MRD sequence is 5, which is half of that of the geometric sequence. 

VII. Conclusion 

In this paper, we proposed incentive schemes based on intervention for power control in single- 
hop wireless ad hoc networks with selfish users. We formulated a game-theoretic model of power 
control with an intervention device and proposed design criteria that desirable intervention rules should 
satisfy. Focusing on a simple class of intervention rules called first-order intervention rules, we provided 
requirements for intervention rules to sustain a target power profile when the intervention device estimates 
individual transmit powers or aggregate receive power. We also proposed dynamic adjustment processes 
to guide users to a target power profile through intermediate targets. We discussed implementation issues 
and presented simulation results. To the best of our knowledge, this work is the first to investigate 
intervention schemes in a power control scenario. For future research, we can apply intervention schemes 
to different power control scenarios, for example, a scenario where users can allocate their power budgets 
across multiple channels and a scenario where users care about their energy consumption as well as their 
data rates. 
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